NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SPIN: sex-specific and pathway-based interpretable neural network for sexual dimorphism analysis

https://doi.org/10.1093/bib/bbae239

Ko, Euiseong; Kim, Youngsoon; Shokoohi, Farhad; Mersha, Tesfaye B; Kang, Mingon (May 2024, Briefings in Bioinformatics)

Abstract Sexual dimorphism in prevalence, severity and genetic susceptibility exists for most common diseases. However, most genetic and clinical outcome studies are designed in sex-combined framework considering sex as a covariate. Few sex-specific studies have analyzed males and females separately, which failed to identify gene-by-sex interaction. Here, we propose a novel unified biologically interpretable deep learning-based framework (named SPIN) for sexual dimorphism analysis. We demonstrate that SPIN significantly improved the C-index up to 23.6% in TCGA cancer datasets, and it was further validated using asthma datasets. In addition, SPIN identifies sex-specific and -shared risk loci that are often missed in previous sex-combined/-separate analysis. We also show that SPIN is interpretable for explaining how biological pathways contribute to sexual dimorphism and improve risk prediction in an individual level, which can result in the development of precision medicine tailored to a specific individual’s characteristics.
more » « less
Full Text Available
Multi-layered self-attention mechanism for weakly supervised semantic segmentation

https://doi.org/10.1016/j.cviu.2023.103886

Yaganapu, Avinash; Kang, Mingon (February 2024, Computer Vision and Image Understanding)

Weakly Supervised Semantic Segmentation (WSSS) provides efficient solutions for semantic image segmentation using image-level annotations. WSSS requires no pixel-level labeling that Fully Supervised Semantic Segmentation (FSSS) does, which is time-consuming and label-intensive. Most WSSS approaches have leveraged Class Activation Maps (CAM) or Self-Attention (SA) to generate pseudo pixel-level annotations to perform semantic segmentation tasks coupled with fully supervised approaches (e.g., Fully Convolutional Network). However, those approaches often provides incomplete supervision that mainly includes discriminative regions from the last convolutional layer. They may fail to capture regions of low- or intermediate-level features that may not be present in the last convolutional layer. To address the issue, we proposed a novel Multi-layered Self-Attention (Multi-SA) method that applies a self-attention module to multiple convolutional layers, and then stack feature maps from the self-attention layers to generate pseudo pixel-level annotations. We demonstrated that integrated feature maps from multiple self-attention layers produce higher coverage in semantic segmentation than using only the last convolutional layer through intensive experiments using standard benchmark datasets.
more » « less
Full Text Available
Evidential deep learning for trustworthy prediction of enzyme commission number

https://doi.org/10.1093/bib/bbad401

Han, So-Ra; Park, Mingyu; Kosaraju, Sai; Lee, JeungMin; Lee, Hyun; Lee, Jun Hyuck; Oh, Tae-Jin; Kang, Mingon (November 2023, Briefings in Bioinformatics)

Abstract The rapid growth of uncharacterized enzymes and their functional diversity urge accurate and trustworthy computational functional annotation tools. However, current state-of-the-art models lack trustworthiness on the prediction of the multilabel classification problem with thousands of classes. Here, we demonstrate that a novel evidential deep learning model (named ECPICK) makes trustworthy predictions of enzyme commission (EC) numbers with data-driven domain-relevant evidence, which results in significantly enhanced predictive power and the capability to discover potential new motif sites. ECPICK learns complex sequential patterns of amino acids and their hierarchical structures from 20 million enzyme data. ECPICK identifies significant amino acids that contribute to the prediction without multiple sequence alignment. Our intensive assessment showed not only outstanding enhancement of predictive performance on the largest databases of Uniprot, Protein Data Bank (PDB) and Kyoto Encyclopedia of Genes and Genomes (KEGG), but also a capability to discover new motif sites in microorganisms. ECPICK is a reliable EC number prediction tool to identify protein functions of an increasing number of uncharacterized enzymes.
more » « less
Full Text Available
PGNets: planet mass prediction using convolutional neural networks for radio continuum observations of protoplanetary discs

https://doi.org/10.1093/mnras/stab3502

Zhang, Shangjia; Zhu, Zhaohuan; Kang, Mingon (December 2021, Monthly Notices of the Royal Astronomical Society)

ABSTRACT We developed convolutional neural networks (CNNs) to rapidly and directly infer the planet mass from radio dust continuum images. Substructures induced by young planets in protoplanetary discs can be used to infer the potential young planets’ properties. Hydrodynamical simulations have been used to study the relationships between the planet’s properties and these disc features. However, these attempts either fine-tuned numerical simulations to fit one protoplanetary disc at a time, which was time consuming, or azimuthally averaged simulation results to derive some linear relationships between the gap width/depth and the planet mass, which lost information on asymmetric features in discs. To cope with these disadvantages, we developed Planet Gap neural Networks (PGNets) to infer the planet mass from two-dimensional images. We first fit the gridded data in Zhang et al. as a classification problem. Then, we quadrupled the data set by running additional simulations with near-randomly sampled parameters, and derived the planet mass and disc viscosity together as a regression problem. The classification approach can reach an accuracy of 92 per cent, whereas the regression approach can reach 1σ as 0.16 dex for planet mass and 0.23 dex for disc viscosity. We can reproduce the degeneracy scaling α ∝ $$M_\mathrm{ p}^3$$ found in the linear fitting method, which means that the CNN method can even be used to find degeneracy relationship. The gradient-weighted class activation mapping effectively confirms that PGNets use proper disc features to constrain the planet mass. We provide programs for PGNets and the traditional fitting method from Zhang et al., and discuss each method’s advantages and disadvantages.
more » « less
Texture-based Deep Learning for Effective Histopathological Cancer Image Classification

https://doi.org/10.1109/BIBM47256.2019.8983226

Tsaku, Nelson Zange; Kosaraju, Sai Chandra; Aqila, Tasmia; Masum, Mohammad; Song, Dae Hyun; Mondal, Ananda M.; Koh, Hyun Min; Kang, Mingon (November 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM))

Automatic histopathological Whole Slide Image (WSI) analysis for cancer classification has been highlighted along with the advancements in microscopic imaging techniques, since manual examination and diagnosis with WSIs are time- and cost-consuming. Recently, deep convolutional neural networks have succeeded in histopathological image analysis. However, despite the success of the development, there are still opportunities for further enhancements. In this paper, we propose a novel cancer texture-based deep neural network (CAT-Net) that learns scalable morphological features from histopathological WSIs. The innovation of CAT-Net is twofold: (1) capturing invariant spatial patterns by dilated convolutional layers and (2) improving predictive performance while reducing model complexity. Moreover, CAT-Net can provide discriminative morphological (texture) patterns formed on cancerous regions of histopathological images comparing to normal regions. We elucidated how our proposed method, CAT-Net, captures morphological patterns of interest in hierarchical levels in the model. The proposed method out-performed the current state-of-the-art benchmark methods on accuracy, precision, recall, and F1 score.
more » « less
Full Text Available

Search for: All records